The Complexity of Learning Halfspaces using Generalized Linear Methods
نویسندگان
چکیده
Many popular learning algorithms (E.g. Regression, Fourier-Transform based algorithms, Kernel SVM and Kernel ridge regression) operate by reducing the problem to a convex optimization problem over a set of functions. These methods offer the currently best approach to several central problems such as learning half spaces and learning DNF’s. In addition they are widely used in numerous application domains. Despite their importance, there are still very few proof techniques to show limits on the power of these algorithms. We study the performance of this approach in the problem of (agnostically and improperly) learning halfspaces with margin γ. Let D be a distribution over labeled examples. The γ-margin error of a hyperplane h is the probability of an example to fall on the wrong side of h or at a distance ≤ γ from it. The γ-margin error of the best h is denoted Errγ(D). An α(γ)-approximation algorithm receives γ, as input and, using i.i.d. samples of D, outputs a classifier with error rate ≤ α(γ) Errγ(D) + . Such an algorithm is efficient if it uses poly( 1 γ , 1 ) samples and runs in time polynomial in the sample size. The best approximation ratio achievable by an efficient algorithm is O ( 1/γ √ log(1/γ) ) and is achieved using an algorithm from the above class. Our main result shows that the approximation ratio of every efficient algorithm from this family must be ≥ Ω ( 1/γ poly(log(1/γ)) ) , essentially matching the best known upper bound. ∗Department of Mathematics, Hebrew University, Jerusalem 91904, Israel. [email protected] †School of Computer Science and Engineering, Hebrew University, Jerusalem 91904, Israel. [email protected] ‡School of Computer Science and Engineering, Hebrew University, Jerusalem 91904, Israel. [email protected] ar X iv :1 21 1. 06 16 v4 [ cs .L G ] 1 0 M ay 2 01 4
منابع مشابه
Active Learning of Halfspaces under a Margin Assumption
We derive and analyze a new, efficient, pool-based active learning algorithm for halfspaces, called ALuMA. Most previous algorithms show exponential improvement in the label complexity assuming that the distribution over the instance space is close to uniform. This assumption rarely holds in practical applications. Instead, we study the label complexity under a large-margin assumption—a much mo...
متن کاملThe regularized least squares algorithm and the problem of learning halfspaces
We provide sample complexity of the problem of learning halfspaces with monotonic noise, using the regularized least square algorithm.
متن کاملCryptographic Hardness Results for Learning Intersections of Halfspaces
We give the first representation-independent hardness results for PAC learning intersections of halfspaces, a central concept class in computational learning theory. Our hardness results are derived from two public-key cryptosystems due to Regev, which are based on the worstcase hardness of well-studied lattice problems. Specifically, we prove that a polynomial-time algorithm for PAC learning i...
متن کاملLearning Functions of Halfspaces using Prefix Covers
We present a simple query-algorithm for learning arbitrary functions of k halfspaces under any product distribution on the Boolean hypercube. Our algorithms learn any function of k halfspaces to within accuracy ε in time O((nk/ε)) under any product distribution on {0, 1} using read-once branching programs as a hypothesis.. This gives the first poly(n, 1/ε) algorithm for learning even the inters...
متن کاملImproved Lower Bounds for Learning Intersections of Halfspaces
We prove new lower bounds for learning intersections of halfspaces, one of the most important concept classes in computational learning theory. Our main result is that any statistical-query algorithm for learning the intersection of √ n halfspaces in n dimensions must make 2 √ n) queries. This is the first non-trivial lower bound on the statistical query dimension for this concept class (the pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014